National Repository of Grey Literature 23 records found  1 - 10nextend  jump to record: Search took 0.01 seconds. 
Automatically Updated Bibliography
Valo, Boris ; Škoda, Petr (referee) ; Smrž, Pavel (advisor)
This paper describes the development of application for automatically updated bibliography. Nowadays, many Internet users search informations they need, this is important especially in sets of scientific publications and articles. The aim of this thesis is convenient tool for users to create their own portal. This is achieved by storing documents and their subsequent search using ElasticSearch. Retrieval is made by Boolean queries and additional search using similarity search tool MoreLikeThis. At the end of this thesis is described the way of testing and evaluation of retrieval.
Index Suitable for Similar Search in High-dimensional Spaces
Krejčová, Martina ; Kopecký, Michal (advisor) ; Skopal, Tomáš (referee)
In this paper, we focus on indexing and searching in high-dimensional data. To achieve the target we implemented the Metric Index, a model of the similarity search based on the metric spaces, that employs many of known principles of partitioning and filtering. The metric space is a general model of similarity, which enables the usage of implemented index for various data. With this index, stored data could be searched effectively. The internal structure of data is hidden, we just require an implementation of the function for feature extraction, which produces a vector representing data, and the metric function applicable to the given data. The Metric Index was implemented as a data cartridge, the mechanism for extending the capabilities of the Oracle server. This data cartridge enables indexing of large unstructured data in the Oracle server known as LOBs.
Podobnostní vyhledávání obrázků na webu
Grošup, Tomáš ; Lokoč, Jakub (advisor) ; Hoksza, David (referee)
The subject of this bachelor thesis is to design and create a web portal, enabling efficient indexing and content-based searching of images obtained from various free image databases (e.g., results from a keyword-based search engine). The portal provides fast feature extraction technique and for the visual similarity, the signature quadratic form distance is utilized. The search supports various user settings and comparison of their results. Search results can also be presented using a layout based on particle physics, which supports exploration and multi-query.
Similarity search in image collections
Navrátil, Lukáš ; Bartoš, Tomáš (advisor) ; Skopal, Tomáš (referee)
Detection of keypoints from image and their characterization by using descriptors is common technique in some branches of computer vision. The goal of this thesis is to explore and confirm usability of this technique for similarity retrieval in image collections. For this purpose it will be created a web application used for collecting ratings of similarity from users which will be subsequently compared with results computed by the implementation of SURF algorithm, one of algorithms used for detection and description of image keypoints. It will also be discussed the impact of metrics and parameters influencing results of computation of similarity between images and it will be made an effort to find settings for which computed results will be closest to user's similarity perception.
Similarity search in Mass Spectra Databases
Novák, Jiří ; Skopal, Tomáš (advisor) ; Svozil, Daniel (referee) ; Nahnsen, Sven (referee)
Shotgun proteomics is a widely known technique for identification of protein and peptide sequences from an "in vitro" sample. A tandem mass spectrometer generates tens of thousands of mass spectra which must be annotated with peptide sequences. For this purpose, the similarity search in a database of theoretical spectra generated from a database of known protein sequences can be utilized. Since the sizes of databases grow rapidly in recent years, there is a demand for utilization of various database indexing techniques. We investigate the capabilities of (non)metric access methods as the database indexing techniques for fast and approximate similarity retrieval in mass spectra databases. We show that the method for peptide sequences identification is more than 100x faster than a sequential scan over the entire database while more than 90% of spectra are correctly annotated with peptide sequences. Since the method is currently suitable for small mixtures of proteins, we also utilize a precursor mass filter as the database indexing technique for complex mixtures of proteins. The precursor mass filter followed by ranking of spectra by a modification of the parametrized Hausdorff distance outperforms state-of-the-art tools in the number of identified peptide sequences and the speed of search. The...
Similarity Search in Protein Structure Databases
Galgonek, Jakub ; Skopal, Tomáš (advisor) ; Porto, Markus (referee) ; Svozil, Daniel (referee)
Proteins are one of the most important biopolymers having a wide range of functions in living organisms. Their huge functional diversity is achieved by their ability to fold into various 3D structures. Moreover, it has been shown that proteins sharing similar structure often share also other properties (e.g, a biological function, an evolutionary origin, etc.). Therefore, protein structures and methods to identify their similarities are so widely studied. In this thesis, we introduce a system allowing similarity search in pro- tein structure databases. The system retrieves, given a query structure, all database structures being similar to the query structure. It employs several key components. We have introduced a novel similarity measure assigning similarity scores to pairs of protein structures. We have designed specific access method based on LAESA metric indexing and using the proposed measure. The access method allows to search similar structures more effi- ciently than when a sequential scan of a database is employed. To achieve further speedup, the measure and the access method have been parallelized, resulting in almost linear speedup with the respect to the number of available cores. The last component is a web user interface that allows to accept a query structure and to present a list of...
Employing Parallel Architectures in Similarity Search
Kruliš, Martin ; Yaghob, Jakub (advisor) ; Platoš, Jan (referee) ; Pllana, Sabri (referee)
This work examines the possibilities of employing highly parallel architectures in database systems, which are based on the similarity search paradigm. The main objective of our research is utilizing the computational power of current GPU devices for similarity search in the databases of images. Despite leaping progress made in the past few years, the similarity search problems remain very expensive from a compu- tational point of view, which limits the scope of their applicability. GPU devices have a tremendous computational power at their disposal; however, the usability of this power for particular problems is often complicated due to the specific properties of this architecture. Therefore, the existing algorithms and data structures require extensive modifications if they are to be adapted for the GPUs. We have addressed all the aspects of this domain, such as efficient utilization of the GPU hardware for generic computations, parallelization of similarity search process, and acceleration of image indexing techniques. In most cases, employing the GPU devices brought a speedup of two orders of magnitude with respect to single-core CPUs and approximately one order of magnitude with respect to multiprocessor NUMA servers. This thesis summarizes our experience and discoveries from several years of research,...
Similarity Search in Protein Structure Databases
Galgonek, Jakub
Proteins are one of the most important biopolymers having a wide range of functions in living organisms. Their huge functional diversity is achieved by their ability to fold into various 3D structures. Moreover, it has been shown that proteins sharing similar structure often share also other properties (e.g, a biological function, an evolutionary origin, etc.). Therefore, protein structures and methods to identify their similarities are so widely studied. In this thesis, we introduce a system allowing similarity search in pro- tein structure databases. The system retrieves, given a query structure, all database structures being similar to the query structure. It employs several key components. We have introduced a novel similarity measure assigning similarity scores to pairs of protein structures. We have designed specific access method based on LAESA metric indexing and using the proposed measure. The access method allows to search similar structures more effi- ciently than when a sequential scan of a database is employed. To achieve further speedup, the measure and the access method have been parallelized, resulting in almost linear speedup with the respect to the number of available cores. The last component is a web user interface that allows to accept a query structure and to present a list of...
Content-based exploration of unstructured data
Čech, Přemysl ; Lokoč, Jakub (advisor) ; Barthel, Kai Uwe (referee) ; Gudmundsson, Gylfi Thor (referee)
Effective analysis, searching and browsing throughout arbitrary multimedia collections is still a challenging task. To perform a search among multimedia objects, first, a similarity model has to be defined. Such a model establishes methods describing how the content of individual objects is processed and how key features and descriptors, that are used for modeling similarity between objects, are formed. This task is not trivial since there can be many ways of determining how to comprehend the content of multimedia data. Furthermore, with the growing size of contemporary database collections, multimedia retrieval and exploration are extremely computationally intensive. Hence, researchers investigate support indexing structures that can evaluate similarity queries and can respond to user's queries in almost real-time even on datasets counting billions of objects. Another very important aspect of a retrieval system is the user interface for defining queries as well as presenting retrieved results. A multimedia system should offer various inputs for formulating user's queries, especially for situations in which a user cannot provide an ideal query example. Finally, a well- arranged and easy to read interface for visualization of retrieved results is essential for the success of a multimedia exploration and...
Comparison of signature-based and semantic similarity models
Kovalčík, Gregor ; Lokoč, Jakub (advisor) ; Mráz, František (referee)
Content-based image retrieval and similarity search has been investigated for several decades with many different approaches proposed. This thesis fo- cuses on a comparison of two orthogonal similarity models on two different im- age retrieval tasks. More specifically, traditional image representation models based on feature signatures are compared with models based on state-of-the-art deep convolutional neural networks. Query-by-example benchmarking and tar- get browsing tasks were selected for the comparison. In a thorough experimental evaluation, we confirm that models based on deep convolutional neural networks outperform the traditional models. However, in the target browsing scenario, we show that the traditional models could still represent an effective option. We have also implemented a feature signature extractor into the OpenCV library in order to make the source codes available for the image retrieval and computer vision community. 1

National Repository of Grey Literature : 23 records found   1 - 10nextend  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.